Dual-Mode AVQ Coding Based on Spectral Masking and Sparseness Detection for ITU-T G.711.1/G.722 Super-Wideband Extensions
نویسندگان
چکیده
ITU-T Recommendations G.711.1 Annex D and G.722 Annex B, which are super-wideband (50–14,000 Hz) extensions to G.711.1 and G.722, have been recently standardized. This paper introduces a new coding method proposed and employed in the above ITU-T standards. The proposed coding method employs an adaptive spectral masking of the algebraic vector quantization (AVQ) for MDCT-domain non-sparse signals. The adaptive spectral masking is switched on and off based on MDCT-domain sparseness analysis. When the target MDCT coefficients are categorized as non-sparse, masking level of the target MDCT coefficients is adaptively controlled using spectral envelope information. The performance of the proposed method as a part of the ITU-T G.711.1 Annex D is evaluated in comparison with the ordinary AVQ. Subjective listening test results show that the proposed method improves the sound quality more than 0.1 points with a five grade scale in average of speech, music and mixed content, and the significance of the improvement is validated.
منابع مشابه
On the cost of backward compatibility for communication codecs
Super wideband (SWB) communication calls more and more attention as can be seen by the standardization activities of SWB extensions for well-established wideband codecs, e.g. G.722 or G.711.1. This paper presents a technical solution for extending the G.722 codec and compares the new technology to other standardized SWB codecs. Hereby, a closer look is given on the concept of extending technolo...
متن کاملTree Encoding for the ITU-T G.711.1 Speech Coder
This paper examines enhancement to ITU-T Recommendation G.711.1 PCM wideband extension speech coder. To further improve the core lower-band coding performance the use of vector quantization and delayed decision coding is studied. A particular case of delayed decision coding, tree encoding, is implemented in the above standard. The bitstream is compatible with both the legacy G.711 and the G.711...
متن کاملA PCM coding noise reduction for ITU-t g.711.1
The ITU-T G.711.1 embedded wideband speech codec was approved by ITU-T in March 2008. This codec generates a bitstream comprised of three layers: a G.711 compatible core layer with noise shaping, a lower band enhancement layer and an MDCT-based higher band enhancement layer. It contains also an optional post-processing module called Appendix I designed to improve quality of the decoded speech i...
متن کاملSuper-Wideband Bandwidth Extension for Wideband Audio Codecs Using Switched Spectral Replication and Pitch Synthesis
This paper describes a new bandwidth extension algorithm which is targeted at high quality audio communication over IP networks. The algorithm is part of the Huawei/ETRI candidate for the ITU-T super-wideband (SWB) extensions of Rec. G.729.1 and G.718. In the SWB candidate codec, the 7-14 kHz frequency band of speech and audio signals is represented in terms of temporal and spectral envelopes. ...
متن کاملQuantifying wideband speech codec degradations via impairment factors: the new ITU-t p.834.1 methodology and its application to the g.711.1 codec
Wideband speech codecs usually provide better perceptual speech quality than their narrowband counterparts, but they still degrade quality compared to an uncoded transmission path. In order to quantify these degradations, a new methodology is presented which derives a one-dimensional quality index on the basis of instrumental measurements. This index can be used to rank different wideband speec...
متن کامل